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Abstract. Predicting the winner of an election is a favorite problem 
both for news media pundits and computational social choice theorists. 
Since it is often infeasible to elicit the preferences of all the voters in a 
typical prediction scenario, a common algorithm used for winner predic¬ 
tion is to run the election on a small sample of randomly chosen votes 
and output the winner as the prediction. We analyze the performance of 
this algorithm for many common voting rules. 

More formally, we introduce the [e, 5)-winner determination problem, 
where given an election on n voters and m candidates in which the mar¬ 
gin of victory is at least en votes, the goal is to determine the winner with 
probability at least 1 — 5. The margin of victory of an election is the small¬ 
est number of votes that need to be modified in order to change the elec¬ 
tion winner. We show interesting lower and upper bounds on the number 
of samples needed to solve the (e, 5)-winner determination problem for 
many common voting rules, including scoring rules, approval, maximin, 
Copeland, Bucklin, plurality with runoff, and single transferable vote. 
Moreover, the lower and upper bounds match for many common voting 
rules in a wide range of practically appealing scenarios. 


Keywords: Computational social choice, winner determination, voting, sam¬ 
pling, prediction, polling 


1. INTRODUCTION 


1 Introduction 

A common and natural way to aggregate preferences of agents is through an 
election. In a typical election, we have a set of candidates and a set of voters, and 
each voter reports his preference about the candidates in the form of a vote. We 
will assume that each vote is a ranking of all the candidates. A voting rule selects 
one candidate as the winner once all voters provide their votes. Determining the 
winner of an election is one of the most fundamental problems in social choice 
theory. 

In many situations, however, one wants to predict the winner without holding 
the election for the entire population of voters. The most immediate such exam¬ 
ple is an election poll. Here, the pollster wants to quickly gauge public opinion in 
order to predict the outcome of a full-scale election. For political elections, exit 
polls (polls conducted on voters after they have voted) are widely used by news 
media to predict the winner before official results are announced. In surveys, a 
full-scale election is never conducted, and the goal is to determine the winner, 
based on only a few sampled votes, for a hypothetical election on all the voters. 
For instance, it is not possible to force all the residents of a city to fill out an 
online survey to rank the local Chinese restaurants, and so only those voters 
who do participate have their preferences aggregated. 

If the result of the poll or the survey has to reflect the true election outcome, 
it is obviously necessary that the number of sampled votes not be too small. 
Here, we investigate this fundamental question: 

What is the minimum number of votes that need to be sampled so that 

the winner of the election on the sampled votes is the same as the winner 

of the election on all the votes? 

This question can be posed for any voting rule. The most immediate rule 
to study is the plurality voting rule, where each voter votes for a single candi¬ 
date and the candidate with most votes wins. Although the plurality rule is the 
most common voting rule used in political elections, it is important to extend 
the analysis to other popular voting rules. For example, the single transferable 
vote rule is used in political elections in Australia, India and Ireland, and it was 
the subject of a nationwide referendum in the UK in 2011. The Borda voting 
rule is used in the Icelandic parliamentary elections. Outside politics, in private 
companies and competitions, a wide variety of voting rules are used. For exam¬ 
ple, the approval voting rule has been used by the Mathematical Association of 
America, the American Statistical Institute, and the Institute of Electrical and 
Electronics Engineers, and Condorcet consistent voting rules are used by many 
free software organizations. Section 2 discusses the most common voting rules 
in use. 

Regardless of the voting rule, though, the question of hnding the minimum 
number of vote samples required becomes trivial if a single voter in the election 
can change the winning candidate. In this case, all the votes need to be counted, 
because otherwise that single crucial vote may not be sampled. We get around 
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this problem by assuming that in the elections we consider, the winning candi¬ 
date wins by a considerable margin of victory. Formally, the margin of victory 
for an election is defined as the minimum number of votes that must be changed 
in order to change the election winner. Note that the margin of victory depends 
not only on the votes cast but also on the voting rule used in the election. 

1.1 Our Contributions 

Let the number of voters be n and the number of candidates m. We introduce 
and study the following problem^: 

Definition 1. ({e,S)-winner determination) 

Given a voting rule and a set of n votes over a set of m candidates such that 
the margin of victory is at least en, determine the winner of the election with 
probability at least 1 — (5. (The probability is taken over the internal coin tosses 
of the algorithm.) 

We remind the reader that there is no assumption about the distribution 
of votes in this problem. Our goal is to solve the {e, (5)-winner determination 
problem by a randomized algorithm that is allowed to query the votes of arbitrary 
voters. Each query reveals the full vote of the voter. The minimum number 
of votes queried by any algorithm that solves the (e,5)-winner determination 
problem is termed the sample complexity. The sample complexity can of course 
depend on e, i5, n, m, and the voting rule in use. 

A standard result [Canetti et ah, 1995] shows that solving the above prob¬ 
lem for the majority rule on 2 candidates requires at least 12(l/e^ log 1/^) sam¬ 
ples (Theorem 2). Also, a straightforward argument (Theorem 4) using Chernoff 
bounds shows that for any homogeneous voting rule, the sample complexity is 
at most Oipn'f je^ ■ log(m!/(5)). So, when m is a constant, the sample complexity 
is of the order 0(l/£^ log 1/(5) for any homogeneous voting rule that reduces to 
majority on 2 candidates (as is the case for all rules commonly used). Note that 
this bound is independent of n if e and (5 are constants, for any reasonable voting 
rule! 

Our main technical contribution is in understanding the dependence of the 
sample complexity on m, the number of candidates. Note that the upper bound 
cited above has very bad dependence on m and is clearly unsatisfactory in situ¬ 
ations when m is large (such as in online surveys about restaurants). 

- We show that the sample complexity of the (e, (5)-winner determination prob¬ 
lem is 6*(^ log j-) for the fc-approval voting rule when k = o{m) (Theorem 7) 
and the plurality with runoff voting rule (Theorem 11). In particular, for the 
plurality rule, the sample complexity is independent of m as well as nl 

- We show that the sample complexity of the (e, (5)-winner determination prob¬ 
lem is Q( '°s(™/^) ^ and (X _ (5))for the fc-approval voting rule when 

^ Throughout this section, we use standard terminlogy from voting theory. For formal 
definitions, refer to Section 2. 
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k = cm with 0 < c < 1 (Theorem 6), Borda (Theorem 3), approval (The¬ 
orem 5), maximin (Theorem 8), and Bucklin (Theorem 10) voting rules. 
Note that when 5 is a constant, the upper and lower bounds match up to 
constants. We observe a surprising jump in the sample complexity of the 
(e, (5)-winner determination problem by a factor of logm for the fc-approval 
voting rule as k varies from o(m) to cm with c € (0,1). 

1 3 m 

- We show a sample complexity upper bound of for the {e,S)- 

winner determination problem for the Copeland“ voting rule (Theorem 9) 
and 0{ ^ {m+iog STV voting rule (Theorem 12). 

We summarize the results in Table 1. 


Voting Rule 

Sample complexity 

fc-approval 

0(^ log 1)Theorem 7 

12(i°g('f,+i). (1 <5))Theorem 3 

Scoring Rules 

O(i^) Theorem 6 

^( 1 ^ (^_^))t Theorems 

Borda 

Approval 

O(i^) Theorem 5 

Maximin 

O(i^) Theorem 8 

Copeland 

Q(iog^) Theorem 9 

Bucklin 

0(l25^) Theorem 10 

Plurality with runoff 

(Theorem 11 

12(^logi)* Corollary 1 

STV 

0(^'(^+i°"^) (Theorem 12 

Any homogeneous voting rule 

^(miMog^ (Theorem 4 


Table 1. Sample complexity of the (e, 5)-winner determination problem for various 
voting rules. f-The lower bound of Q{ . (1 — <5)) also applies to any voting rule 
that is Condorcet consistent. The lower bound of log |) holds for any voting 
rule that reduces to the plurality voting rule for elections with two candidates. 


The rest of the paper is organized as follow. We introduce the terminologies 
and define the problem formally in Section 2; we present the results on lower 
bounds in Section 3; Section 4 contains the results on the upper bounds for 
various voting rules; finally, we conclude in Section 5. 

This paper is a significant extension of the conference version of this 
work Dey and Bhattacharyya [2015]: this extended version includes all the 
proofs. 
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1.2 Related Work 

The subject of voting is at the heart of (computational) social choice the¬ 
ory, and there is a vast amount of literature in this area. Elections take 
place not only in human societies but also in manmade social networks 
[Boldi et ah, 2009, Rodriguez et ah, 2007] and, generally, in many multiagent 
systems [Ephrati and Rosenschein, 1991, Pennock et ah, 2000]. The winner de¬ 
termination problem is the task of finding the winner in an election, given the 
voting rule in use and the set of all votes cast. It is known that there are natural 
voting rules, e.g., Kemeny’s rule and Dodgson’s method, for which the winner de¬ 
termination problem is NP-hard (Bartholdi III et ah, 1989, Hemaspaandra et ah, 
1997, 2005]. 

The general question of whether the outcome of an election can be deter¬ 
mined by less than the full set of votes is the subject of preference elicitation^ 
a central category of problems in AI. The (e, (5)-winner determination problem 
also falls in this area when the elections are restricted to those having margin 
of victory at least en. For general elections, the preference elicitation problem 
was studied by Conitzer and Sandholm [Conitzer and Sandholm, 2002], who de¬ 
fined an elicitation policy as an adaptive sequence of questions posed to voters. 
They proved that hnding an efficient elicitation policy is N P-hard for many com¬ 
mon voting rules. Nevertheless, several elicitation policies have been developed 
in later work [Conitzer, 2009, Ding and Lin, 2012, Lu and Boutilier, 2011a,b, 
Oren et ah, 2013] that work well in practice and have formal guarantees under 
various assumptions on the vote distribution. Another related work is that of 
Dhamal and Narahari [Dhamal and Narahari, 2013] who show that if the voters 
are members of a social network where neighbors in the network have similar 
candidate votes, then it is possible to elicit the votes of only a few voters to 
determine the outcome of the full election. 

In contrast, in our work, we posit no assumption on the vote distribution 
other than that the votes create a substantial margin of victory for the winner. 
Under this assumption, we show that even for voting rules in which winner de¬ 
termination is N P-hard in the worst case, it is possible to sample a small number 
of votes to determine the winner. Our work falls inside the larger framework of 
property testing [Ron, 2001], a class of problems studied in theoretical computer 
science, where the inputs are promised to either satisfy some property or have 
a “gap” from instances satisfying the property. In our case, the instances are 
elections which either have some candidate w as the winner or are “far” from 
having w being the winner (in the sense that many votes need to be changed). 

The basic model of elections has been generalized in several other ways to 
capture real world situations. One important consideration is that the votes may 
be incomplete rankings of the candidates and not a complete ranking. There 
can also be uncertainty over which voters and/or candidates will eventually 
turn up. The uncertainty may additionally come up from the voting rule that 
will be used eventually to select the winner. In these incomplete information 
settings, several winner models have been proposed, for example, robust win¬ 
ner [Boutilier et ah, 2014, Lu and Boutilier, 2011a, Shiryaev et ah, 2013], multi 
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winner [Lu and Boutilier, 2013], stable winner [Falik et al., 2012], approximate 
winner [Doucette et al., 2014], probabilistic winner [Bachrach et al., 2010]. Ha- 
zon et al. [Hazon et al., 2008] proposed useful methods to evaluate the outcome 
of an election under various uncertainties. We do not study the role of uncer¬ 
tainty in our work. 

Organization We formally introduce the terminologies in Section 2; we present 
the results on lower bounds in Section 3; Section 4 contains the results on the 
upper bounds for various voting rules; finally, we conclude in Section 5. 

2 Preliminaries 

2.1 Voting and Voting Rules 

Let V = {ui,..., Vn} be the set of all voters and C = {ci,..., Cm} the set of all 
candidates. Each voter ViS vote is a complete order over >-i over the candidates 
C. For example, for two candidates a and b, a b means that the voter Vi 
prefers a to b. We denote the set of all complete orders over C by £(C). Hence, 
£(C)" denotes the set of all n-voters’ preference profiles 

A map r : W„,|C|eN+'^(l^)" —^ ^ is called a voting rule. Given a vote profile 
/1(C)"’, we call r{y) the winner. Note that in this paper, each election has a 
unique winner, and we ignore the possibility of ties. A voting rule is called homo¬ 
geneous if it selects the winner solely based on the fraction of times each complete 
order from C(C) appears as a vote in the election. All the commonly used voting 
rules including the ones that are studied in this paper are homogeneous. 

Given an election E, we can construct a weighted graph Ge called weighted 
majority graph from E. The set of vertices in Ge is the set of candidates 
in E. For any two candidates x and y, the weight on the edge {x, y) is 
DE{x,y) = NE{x,y) - NE{y,x), where V£;(a;,y)(respectively NE{y,x)) is the 
number of voters who prefer x to y (respectively y to x). A candidate x is called 
the Condorcet winner in an election E if //^(a;, y) > 0 for every other candidate 
y ^ X. A voting rule is called Condorcet consistent if it selects the Condorcet 
winner as the winner of the election whenever it exists. 

Some examples of common voting rules^ are: 

Positional scoring rules: A collection of m-dimensional vectors Sm = 
(oi, 02 , ■ • ■, oim) G R™ with Qfi > 02 > • • • > Orra and oi > am for every 
m G N naturally defines a voting rule - a candidate gets score from a vote 
if it is placed at the i*^ position, and the score of a candidate is the sum of 
the scores it receives from all the votes. The winner is the candidate with 
maximum score. 

Without loss of generality, we assume that for any score vector a, there 
exists a j such that aj = 1 and Ofc = 0 for all k > j. The vector a that is 

^ In all these rules, the possibilities of ties exist. If they do happen, we assume that 
some arbitrary but fixed tie breaking rule is applied. 
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1 in the first k coordinates and 0 otherwise gives the k-approval voting rule. 
1-approval is called the plurality voting rule, and (m — l)-approval is called 
the veto voting rule. The score vector (m— 1, m — 2,..., 1,0) gives the Borda 
voting rule. 

- Approval: In approval voting, each voter approves a subset of candidates. 
The winner is the candidate which is approved by the maximum number of 
voters. 

Maximin: The maximin score of a candidate x is min^^a, D{x, y). The win¬ 
ner is the candidate with maximum maximin score. 

Copeland": The Copeland" score of a candidate a: is \{y ^ x : (x, y) > 

0}| -|-a|{y 7 ^ X : D£{x, y) = 0}|, where a £ [0,1]. The winner is the candidate 
with the maximum Copeland score. 

Bucklin: A candidate x’s Bucklin score is the minimum number I such that 
more than half of the voters rank x in their top I positions. The winner is 
the candidate with lowest Bucklin score. 

- Plurality -with runoff: The top two candidates according to plurality score 
are selected first. The pairwise winner of these two candidates is selected as 
the winner of the election. This rule is often called the runoff YOting rule. 

- Single Transferable Vote: In Single Transferable Vote (STV), a candi¬ 
date with least plurality score is dropped out of the election and its votes 
are transferred to the next preferred candidate. If two or more candidates 
receive least plurality score, then tie breaking rule is used. The candidate 
that remains after (m — 1) rounds is the winner. 

Among the above voting rules, only the maximin and the Copeland voting 
rules are Condorcet consistent. 

Given an election, the margin of victory of this election is: 

Definition 2. Given a voting profile y, the margin of victory (MOV) is the 
smallest number of votes k such that the winner can he changed by changing k 
many votes in )^, while keeping other votes unchanged. 

Xia [Xia, 2012] showed that for most common voting rules (including all 
those mentioned above), when each voter votes i.i.d. according to a distribution 
on the candidates, the margin of victory is with high probability, either 0{^/n) 
or 0(n). 


2.2 Statistical Distance Measures 

Given a finite set X, a distribution y on X is defined as a function y : X —> 
[0,1], such that = 1- The finite set X is called the base set of the 

distribution y. We use the following distance measures among distributions in 
our work. 

Definition 3. The KL divergence [Kullback and Leibler, 1951] and the Jensen- 
Shannon divergence [Lin, 1991] between two distributions yi and y 2 on X are 
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defined as follows. 

DKL{f^i\\f^ 2 ) = ^ All (a:) log 

JS{ni,H2) = ^ (dkl +Dkl 2 

The Jensen-Shannon divergence has subsequently been generalized to mea¬ 
sure the mutual distance among more than two distributions as follows. 

Definition 4. Given n distributions fii, ..., over the same base set, the gen¬ 
eralized Jensen-Shannon divergence^ among them is: 

1 " [ 1 " \ 

, /i„) = — ^ ^ I 11 ~ ^ ^ 1 

i=i \ ” i=i / 


2.3 ChernofT Bound 

We repeatedly use the following concentration inequality: 

Theorem 1. Let Xi,..., be a sequence of i independent random variables 
in [0,1] (not necessarily identical). Let S = Xi and let fi = ¥, [S']. Then, for 
any 0 < J < 1; 

Pr[|S — pl\ > Si] < 2 exp{—2£S^) 

and 

Pr[|S — fi\> SpL] < 2 exp(— 

The first inequality is called an additive bound and the second multiplicative. 

3 Results on Lower Bounds 

Our lower bounds for the sample complexity of (e, J)-winner determination are 
derived from information-theoretic lower bounds for distinguishing distributions. 

We start from the following basic observation. Let X be a random variable 
taking value 1 with probability \ — £ and 0 with probability t -be; F be a random 
variable taking value 1 with probability | -b e and 0 with probability | — e. 
Then, it is well-known that every algorithm needs log j-) many samples to 
distinguish between X and Y with probability of making an error being at most 
5 [Bar-Yossef et ah, 2001, Canetti et ah, 1995]. Immediately, we have: 

Theorem 2. The sample complexity of the {e,d)-winner determination problem 
for the plurality voting rule is log |). 

® The generalized Jensen-Shannon divergence is often formulated with weights on each 
of the n distributions. The definition here puts equal weight on each distribution and 
is sufficient for our purposes. 







3. RESULTS ON LOWER BOUNDS 


Proof. Consider an election with two candidates a and b. Consider two vote 
distributions X and Y. In X, exactly ^ + e fraction of voters prefer a to b and 
thus a is the plurality winner of the election. In Y, exactly 5 + e fraction of voters 
prefer 6 to a and thus b is the plurality winner of the election. Also, the margin 
of victory of both the elections corresponding to the vote distributions X and Y 
is £71, since each vote change can change the plurality score of any candidate by 
at most one. Any (e, i5)-winner determination algorithm for plurality will give us 
a distinguisher between the distributions X and Y with probability of error at 
most S and hence will need log |) samples. □ 

Theorem 2 immediately gives us the following corollary. 

Corollary 1. Every {e,S)-winner determination algorithm needs l7(^log|) 
many samples for any voting rule which reduces to the plurality rule for two 
candidates. In particular, the lower bound holds for approval, scoring rules, max- 
imin, Copeland, Bucklin, plurality with runoff, and STV voting rules. 

Proof. All the voting rules mentioned in the statement except the approval vot¬ 
ing rule is same as the plurality voting rule for elections with two candidates. 
Hence, the result follows immediately from Theorem 2 for the above voting rules 
except the approval voting rule. The result for the approval voting rule follows 
from the fact that any arbitrary plurality election is also a valid approval election 
where every voter approves exactly one candidate. □ 

We derive stronger lower bounds in terms of m by explicitly viewing the 
(e, ^)-winner determination problem as a statistical classification problem. In 
this problem, we are given a black box that contains a distribution p which 
is guaranteed to be one of i known distributions pi,... ,pi. A classifier is a 
randomized oracle which has to determine the identity of p, where each oracle call 
produces a sample from p. At the end of its execution, the classifier announces a 
guess for the identity of p, which has to be correct with probability at least 1 — S. 
Using information-theoretic methods, Bar-Yossef [Bar-Yossef, 2003] showed the 
following: 


Lemma 1. The worst case sample complexity q of a classifier C for pi,..., pi 
which does not make error with probability more than S satisfies following. 


q> 12 


log£ 


JS {pi, ...,pi) 


.{ 1 - 8 ) 


The connection with our problem is the following. A set U of n votes on a 
candidate set C generates a probability distribution pv on C{C), where pv{>~) 
is proportional to the number of voters who voted Querying a random vote 
from V is then equivalent to sampling from the distribution pv. The margin of 
victory is proportional to the minimum statistical distance between pv and pw, 
over all the voting profiles W having a different winner than the winner of V. 

Now, suppose we have m voting profiles Vi,..., Vm having different winners 
such that each Vi has margin of victory at least en. Any {e, (5)-winner determina¬ 
tion algorithm must also be a statistical classiher for pv ^, ■ • ■, PVm the above 
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sense. It then remains to construct such voting profiles for various voting rules 
which we do in the proof of the following theorem: 


Theorems. Every (e,S)-winner determination algorithm needs 
O ^ . (1 — S)'j many samples for approval, Borda, Bucklin, and any 

Condorcet consistent voting rules, and Q (1 — 6)^ many samples for the 

k-approval voting rule. 


Proof. For each voting rules mentioned in the theorem, we will show d {d = k + 1 
for the fc-approval voting rule and d = m for the rest of the voting rules) many 
distributions on the votes with the following properties the result 

follows from Lemma 1. Let Vi be an election where each vote v G C{C) occurs 
exactly yii{v) ■ n many times. Let tt- = 2 Ti- 

1. For every i ^ j, the winner in Vi is different from the winner in Vj. 

2. For every i, the margin of victory of 14 is !7(en). 

3 . DKMId) = 0(e^) 

The distributions for different voting rules are as follows. Let the candidate set 
be C = {ci,.. .,Cm}. 

fc-approval voting rule. Fix any arbitrary M ■.= k + \ many candidates 
Cl,..., Cm- For i G [M], we define a distribution Hi on all fc sized subsets of C 
(for the fc-approval voting rule, each vote is a fc-sized subset of C) as follows. 
Each fc sized subset corresponds to top fc candidates in a vote. 




if Ci € a: and x C {ci,..., cm} 
Ci ^ X and x C {ci,..., cm} 
else 


1—e 


The score of Ci in I 4 is n ( e -f (1 — e) J i score of any other candidate 

Cj G {ci,..., Cm} \ {ci} is n (1 — e) ^ , and the score of the rest of the can- 

l fc J 

didates is zero. Hence, the margin of victory is L2{en), since each vote change 
can reduce the score of Ci by at most one and increase the score of any other 
candidate by at most one. This proves the result for the fc-approval voting rule. 
Now, we show that DKL{di\\d) be O(e^). 


M 


DKLid^Wd) = { ^ + (1 - e) ) log ( 1 - e + e— ) + (1 - ^) ( 1 - TF ) 


fc 


M 


M 


M 


^ ^ + (1 - £) 77 UtT - £ - (1 - ^) 1 - 77 


M 




V k 


< 2e^ 
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Approval voting rule. The result follows from the fact that every 
approval election is also a valid approval election and Lemma 2. 

Borda, any Condorcet consistent voting rule. The score vector for the 
Borda voting rule which we use in this proof is (to, to — 1,..., 1). For i G [to], 
we define a distribution on all possible linear orders over C as follows. 





if Ci is within top ^ positions in x. 
else 


The score of Ci in Vi is ^(1 + §) whereas the score of any other candidate 
Cj ^ Ci is Hence, the margin of victory is at least since each vote change 
can reduce the score of Ci by at most to and increase the score of any other 
candidate by at most to. Also, in the weighted majority graph for the election 
Vj, w{ci,Cj) = Hence, the margin of victory is at least since each vote 
change can change the weight of any edge in the weighted majority graph by at 
most two. Now, we show that DKL{fJ-i\\fJ-) to be 0{e^). 




< 


i^log(l+£) + l^log(l-£) 

1 e 1 — e 

—^- —£ 


= € 


2 


Bucklin. For i G [m], we define a distribution fii on all ^ sized subsets of C 
as follows. Each ^ sized subset corresponds to the top ^ candidates in a vote. 


fii{x) 


1 —e 




if Ci € a; 
else 


The candidate a occurs within the top ^ positions at least n(l — many 
times, and any candidate Cj ^ Ci occurs within the top ^ positions at most 
^ ^ many times. Hence, the margin of victory is at least since each vote 

change can change the number of time any particular candidate occurs within 
top ^ positions by at most one. Now, we show that DxLiTiWfJ-) to be 0{e^). 


DKL{n^\\^^) = (^1 - log(4- 3e) + ^loge 

< (l - ^) log (4 - 3e) 

= 2^2 


□ 
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4 Results on Upper Bounds 


In this section, we present the upper bounds on the sample complexity of the 
(e, J)-winner determination problem for various voting rules. The general frame¬ 
work for proving the upper bounds is as follows. For each voting rule, we first 
prove a useful structural property about the election when the margin of victory 
is known to be at least en. Then, we sample a few votes uniformly at random to 
estimate either the score of the candidates for score based voting rules or weights 
of the edges in the weighted majority graph for other voting rules. Finally, ap¬ 
pealing to the structural property that has been established, we argue that, the 
winner of the election on the sampled votes will be the same as the winner of 
the election, if we are able to estimate either the scores of the candidates or 
the weights of the edges in the weighted majority graph to a certain level of 
accuracy. 

Before getting into specific voting rules, we prove a straightforward bound 
on the sample complexity for the (e, (5)-winner determination problem for any 
voting rule. 


Theorem 4. There is a {e,S)-winner determination algorithm for every homo¬ 
geneous voting rules with sample complexity 0( ™' ^ ). 


Proof. We sample i votes uniformly at random from the set of votes with re¬ 
placement. Let Xi be an indicator random variable that is 1 exactly when x 
is the i’th sample, and let g{x) be the total number of voters whose vote is x. 
Define g{x) = j J2i=i Using the Chernoff bound (Theorem 1), we have the 
following: 


Pr 


\g{x)-g{x)\ > 


en 

2m!- 


< 2 ■exp I — 


e^£ \ 

'2m!2 J 


By using the union bound, we have the following. 


Pr 


3a: e £(C), \g{x) - gix)\ > 


en 

2m! 


< 2m! ■ exp 



Since the margin of victory is en and the voting rule is anonymous, the winner of 
the i sample votes will be same as the winner of the election if \g{x) — g{x)\ < 
for every linear order x G £{C). Hence, it is enough to take £ = Oim}?je^ ■ 
log(m!/(5)). □ 


4.1 Approval Voting Rule 

We derive the upper bound on the sample complexity for the (e, ^)-winner de¬ 
termination problem for the approval voting rule. 

Lemma 2. If MOV > en and w be the winner of a approval election, then, 
s{w) — s(x) > en, for every candidate x ^ w, where s{y) is the number of 
approvals that a candidate y receives. 
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Proof. Suppose there is a candidate x ^ w such that s{w) — s(x) < en. Then 
there must exist en — 1 votes which does not approve the candidate x. We modify 
these votes to make it approve x. This makes w not the unique winner in the 
modified election. This contradicts the fact that the MOV is at least en. □ 

Theorem 5. There is a {e,d)-winner determination algorithm for the approval 
voting rule with sample complexity Q( ^°g(™/^) 

Proof. Suppose w is the winner. We sample i votes uniformly at random from 
the set of votes with replacement. For a candidate x, let Xi be a random variable 
indicating whether the Fth vote sampled approved x. Define s{x) = j X]i=i 
Then, by an argument analogous to the proof of Theorem 4, Pr[3a; G C, |s(a;) — 
s(a;)| > en/2] < 2m ■ exp (—e^£/2). Thus since MOV> en and by Lemma 2, if 
we take £ = s{w) is greater than s{x) for all x ^ w. □ 


4.2 Scoring Rules 

Now, we move on to the scoring rules. Again, we first establish a structural 
consequence of having large MOV. 

Lemma 3. Suppose a = (oi, ..., am) be a normalized score vector and w is the 
winner of an election using scoring rule a with MOV > en. Then, s(w) — s{x) > 
a\enj2 for every candidate x ^ w, where s{w) and s{x) denote the score of the 
candidates w and x respectively. 

Proof. There must be at least en many votes where w is preferred over x, since 
we can make x win the election by exchanging the positions of x and w in all 
these votes and MOV > en. Let u be a vote where w is preferred to x. Suppose 
we replace the vote v by another vote v' = x >■ others w. We claim that 
this replacement reduces the current value of s[w) — s(x) by at least ai. If we 
change en/2 such votes, then s{w) — s{x) decreases by at least aien/2 but, at 
the same time, w must still be the winner after the vote changes because of the 
MOV condition. So, s{w) — s(a;) > aien/2. 

To prove the claim, suppose w and x were receiving a score of ai and aj 
respectively from the vote v. By replacing the vote v by u', the current value of 
s{w) — s(x) reduces by ai — aj + at, since am = 0. Now, ai — aj + ai > ai 
since in the vote v, the candidate w is preferred over x and hence, aj < ai. This 
proves the result. □ 

Theorem 6. Suppose a = {ai,... ,am) be a normalized score vector. There 
is a (e,S)-winner determination algorithm for the a-scoring rule with sample 
complexity 

Proof. It is enough to show the result for the (2e, d)-winner determination prob¬ 
lem. We sample i votes uniformly at random from the set of votes with replace¬ 
ment. For a candidate x, define Xi = ^ ii x gets a score of ai from the ith sample 
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vote, and let s{x) = Now, using Chernoff bound (Theorem 1), we 

have: 

Pr [|s(a:) — s(x)| > ai£n/4] < 2 exp 

The rest of the proof follows from an argument analogous to the proof of Theo¬ 
rem 5 using Lemma 3. □ 

From Theorem 6 , we have a (sr, (5)-winner determination algorithm for the 
fc-approval voting rule which needs 0{ ^ samples for any k. This is 

tight by Theorem 3 when k = cm for some c € (0,1). 

When k = o{m), we have a lower bound of 17 logy) for the fc-approval 
voting rule (see Corollary 1). We show next that this lower bound is also tight 
for the fc-approval voting rule when fc = o(m). Before embarking on the proof of 
the above fact, we prove the following lemma which will be crucially used. 

Lemma 4. Let f : R —> 'R be a function defined by fix) = e“ ® . Then, 

fix) + f{y) < fix + v), for x,y>Q, >2,x <y 

x + y 

Proof. For the function fix), we have following. 

fix) = 

^ 

^ 

Hence, for x,y > 0, > 2,x < y we have f"ix),f"iy),f"{x + y) > 0. This 

implies following for x < y and an infinitesimal positive S. 

fix) < fiy) 

fjx-S) - fix) fjy) - fjy-S) 

s — s 

fix) + fiy) < fix -S) + fiy + 6) 

=> fix) + fiy) < fix + y) 




□ 

Theorem 7. There is a ie, 6)-winner determination algorithm for the fc- 

loe:f — ^ 

approval voting rule with sample complexity 0(—j^)- 

Proof. It is enough to show the result for the (2e, d)-winner determination prob¬ 
lem. We sample i votes uniformly at random from the set of votes with replace¬ 
ment. For a candidate x, let Xi be a random variable indicating whether x is 
among the top fc candidates for the vote sample. Define six) = 
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and let s(a:) be the actual score of x. Then by the multiplicative Chernoff bound 
(Theorem 1), we have: 


Pr [|s(a;) — s(a:)| > en] < 2 exp 

By union bound, we have the following, 

Pr[3x € C, |s(a:) — s(a:)| > en] 



< 2k exp (—er^^/S) 


Let the candidate w be the winner of the election. The second inequality in the 
above derivation follows from the fact that, the function Gcexp(-I^) is 
maximized in the domain, defined by the constraint: for every candidate x € C, 
s(x) G [0,n] and setting s(x) = n for every x G C and 

s(y) = 0 for every y G C\C', for any arbitrary subset C' C C of cardinality k 
(due to Lemma 4). The rest of the proof follows by an argument analogous to 
the proof of Theorem 4 using Lemma 3. □ 

Notice that, the sample complexity upper bound in Theorem 7 is indepen¬ 
dent of m for the plurality voting rule. Theorem 7 in turn implies the following 
Corollary which we consider to be of independent interest. 

Corollary 2. There is an algorithm to estimate the ioo norm iooih-) of a dis¬ 
tribution /i within an additive factor of e by querying only O(^logj') many 
samples, if we are allowed to get i.i.d. samples from the distribution p. 

Such a statement seems to be folklore in the statistics community 
[Dvoretzky et ah, 1956]. Recently in an independent and nearly simultaneous 
work, Waggoner [Waggoner, 2015] obtained a sharp bound of 4-log(y) for the 
sample complexity in Corollary 2. 


/ e'^ln \ 

V 3s(x)J 


4.3 Maximin Voting Rule 

We now turn our attention to the maximin voting rule. The idea is to sample 
enough numer of votes such that we are able to estimate the weights of the edges 
in the weighted majority graph with certain level of accuracy which in turn leads 
us to predict winner. 

Lemma 5. Suppose MOV > en and w be the winner of a maximin election. 
Then, s(w) — s{x) > en, for every candidate x ^ w, where s(.) is the maximin 
score. 
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Proof. Let w be the winner and x be any arbitrary candidate other than w. 
Suppose, for contradiction, s{w) — s{x) < en. Suppose y be a candidate such 
that N{w, y) = s{w). Now there exist at least en — 1 votes as below. 

We replace en — 1 of such votes by the votes as below. 

Cl>--->y>---->- Cm-2 >■ w 

This makes the maximin score of w less than the maximin score of x. This 
contradicts the assumption that MOV > en. □ 

Theorem 8. There is a [e, 5)-winner determination algorithm for the maximin 
voting rule with sample complexity ). 

Proof Let x and y be any two arbitrary candidates. We sample i votes uniformly 
at random from the set of votes with replacement. Let Xi be a random variable 
defined as follows. 


W = 



if a: y in the i*^ sample 
else 


Define D{x, y) = j We estimate D{x, y) within the closed ball of radius 

en/2 around D{x, y) for every candidates x,y gC and the rest of the proof follows 
from by an argument analogous to the proof of Theorem 5 using Lemma 5. □ 


4.4 Copeland Voting Rule 

Now, we move on to the Copeland" voting rule. The approach is similar to the 
maximin voting rule. However, it turns out that we need to estimate the edge 
weights of the weighted majority graph more accurately for the Copeland" voting 
rule. Xia introduced the brilliant quantity called the relative margin of vietory 
(see Section 5.1 in [Xia, 2012]) which will be used crucially for showing sample 
complexity upper bound for the Copeland" voting rule. Given an election, a 
candidate x G C, and an integer (may be negative also) t, s^(V,x) is defined as 
follows. 


St(y^ x) = \{y G C : y X, D{y, x) < 2t}| + a\{y gC :y^x, D{y, x) = 2t}\ 

For every two distinct candidates x and y, the relative margin of victory, 
denoted by RM(x, y), between x and y is defined as the minimum integer t such 
that, s'_^{V,x) < s^{V,y). Let w be the winner of the election £. We define a 
quantity r{£) to be min 3 ,gc'y{„,}.{i?M(r(;, a;)}. Notice that, given an election £, 
r{£) can be computed in polynomial amount of time. Now we have the following 
lemma. 
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Lemma 6 . Suppose MOV > en and w he the winner of a Copelan(P election. 
Then, RM{w,x) > 2 ([iogml+i) ’ every candidate x ^ w. 

Proof. Follows from Theorem 11 in [Xia, 2012]. 

Theorem 9. There is a {e,S)-winner determination algorithm for CopelandP 

1 3 m 

voting rule with sample complexity 0 (— 

Proof. Let x and y be any two arbitrary candidates and w the Copeland" winner 
of the election. We estimate D{x,y) within the closed ball of radius 5 (pogm]+i) 
around D{x, y) for every candidates x, y G C in a way analogous to the proof of 

Theorem 8. This needs 0{ ^°^^2 ) many samples. The rest of the proof follows 

from Lemma 6 by an argument analogous to the proof of Theorem 4. □ 


4.5 Bucklin Voting Rule 

For the Bucklin voting rule, we will estimate how many times each candidate 
occurs within the first k position for every k G [to]. This eventually leads us to 
predict the winner of the election due to the following lemma. 


Lemma 7. Suppose MOV of a Bucklin election be at least en. Let w be the 
winner of the election and x be any arbitrary candidate other than w. Suppose 


bw 


• r n en , 

mm{i : w is within top i places in at least — I - votes} 


b 


X 


min{i : x is within top i places in at least — 

i 2 


Then, byj < 62 ,. 


en , 
— votes} 

O 


Proof. We prove it by contradiction. So, assume 6^, > b^. Now by changing 
^ votes, we can make the Bucklin score of w to be at least b^. By changing 
another ^ votes, we can make the Bucklin score of x to be at most b^. Hence, 
by changing votes, it is possible not to make w the unique winner which 
contradicts the fact that the MOV is at least en. □ 


Theorem 10. There is a (e, S)-winner determination algorithm for Bucklin vot- 

loe 

ing rule with sample complexity 0{ 

Proof. Let x be any arbitrary candidate and 1 < k < m. We sample I votes 
uniformly at random from the set of votes with replacement. Let Xi be a random 
variable defined as follows. 





if X is within top k places in sample 
else 


Let Sfe(x) be the estimate of the number of times the candidate x has been placed 
within top k positions. That is, Sk{x) = Let Sk{x) be the number 
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of times the candidate x been placed in top k positions. Clearly, = 

j E[Xi] = Sk{x). We estimate Sk{x) within the closed ball of radius en/2 
around Sk{x) for every candidate x € C and every integer k € [to], and the rest 
of the proof follows from by an argument analogous to the proof of Theorem 5 
using Lemma 7. □ 


4.6 Plurality with Runoff Voting Rule 

Now, we move on to the plurality with runoff voting rule. In this case, we first 
estimate the plurality score of each of the candidates. In the next round, we 
estimate the pairwise margin of victory of the two candidates that qualifies to 
the second round. 


Lemma 8. Suppose MOV > en, and w and r be the winner and runner up of a 
plurality with runoff election respectively, and x be any arbitrary candidate other 
than and r. Then, following holds. Let s(.) denote plurality score of candidates. 
Then following holds. 

1. D{w,r) > 2en. 

2. For every candidate x € C \ {w, r}, 2s{w) > s{x) + s{r) + en. 

3. If s{x) > s{r) — then D{w,x) > 

Proof. If the first property does not hold, then by changing en votes, we can 
make r winner. If the second property does not hold, then by changing en votes, 
we can make both x and r qualify to the second round. If the third property does 
not hold, then by changing ^ votes, the candidate x can be sent to the second 
round of the runoff election. By changing another ^ votes, x can be made to 
win the election. This contradicts the MOV assumption. □ 


Theorem 11. There is a (e, d)-winner determination algorithm for the plurality 
with runoff voting rule with sample complexity ). 


Proof. Let x be any arbitrary candidate. We sample I votes uniformly at random 
from the set of votes with replacement. Let, Xi be a random variable defined as 
follows. 


1 , if a: is at first position in the sample 
0 , else 


The estimate of the plurality score of x be s{x). Then s{x) = j X]i=i Let 
s{x) be the actual plurality score of x. Then we have following. 


E[X,] = ^,£;[s(x)] = y = six) 

71 / 


By Chernoff bound, we have the following, 


Pr[|s(a;) — s(a;)| > en] < 


ex.pi{e'^ln/'isix)} 
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By union bound, we have the following, 


Pr[3x G C, |s(a::) 


s(x)| > en] < ^ 

xGC 


2 

exp{e^ln/3s(x)} 


2 

~ expje^Z/S} 


The last line follows from Lemma 4. Notice that, we do not need the random 
variables s(a;) and s(y) to be independent for any two candidates x and y. Hence, 
we can use the same I sample votes to estimate s{x) for every candidate x. 
Now, let y and z be the two candidates that go to the second round. 


Y, = 



iiy)^z in the sample 
else 


The estimate of D{y,z) be D{y,z). Then D{y,z) = Then we have 

following. 


E[Yi] 


Diy,z) 

n 


,E[Diy,z)] 


n 

1 


i 

J2E[Y^=D{y,z) 

2=1 


By Chernoff bound, we have the following, 


PT[\D{y, z) - D{y, z)\ > en] < 


2 

exp{£r^^/3} 


Let A be the event that Va; G C, |s(a;) — s(a;)| < en and \D{y, z) — D{y, z)\ < en. 
Now we have, 

Pr[A] > 1 - + g^p|^2;/3}) 

Since we do not need independence among the random variables s(a), s(&), 
D{w,x), D{y,z) for any candidates a,b,w,x,y, and z, we can use the same 
I sampled votes. Now, from Lemma 8, if |s(a::) — s(x)| < ^ for every candidate x 
and \D{y, z)—D{y, z)\ < ^ for every candidates y and z, then the plurality with 
runoff winner of the sampled votes coincides with the actual runoff winner. The 
above event happens with probability at least 1 — d by choosing an appropriate 

l = 0CY^). □ 


4.7 STV Voting Rule 

Now we move on the STV voting rule. The following lemma provides an upper 
bound on the number of votes that need to be changed to make some arbitrary 
candidate win the election. More specifically, given a sequence of m candidates 
{xi}™ 1 with Xm not being the winner, the lemma below proves an upper bound 
on the number of number of votes that need to be modified such that the can¬ 
didate Xi gets eliminated at the round in the STV voting rule. 
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Lemma 9. Suppose V he a set of votes and w be the winner of a STV election. 
Consider the following chain with candidates xi ^ X 2 ^ Xm and Xm w. 

CdC\ {xi} D C \ {xi, X2} D ... D {xm} 

Let sy{A,x) he the plurality score of a candidate x when all the votes in V are 
restricted to the set of candidates A C C. Let us define C-i = C \ {xi,... ,Xi} 
and Sy(yl) := iniii 2 ;gA{sv(^; 2 ;)}. Then, we have the following. 

m— 1 

(sv (C-i, Xi+i) - Sy {C-i)) > MOV 

i=0 

Proof. We will show that by changing (^v {C-i,Xi+i) — Sy (C-i)) votes, 

we can make the candidate Xm winner. If xi minimizes sy{C,x) over x € C, 
then we do not change anything and define Vi = V. Otherwise, there exist 
sy(C,xi) — Sy(C) many votes of following type. 

xi oi 02 Om-i,Oi g C,V1 < i < m — 1 


We replace sy(C,Xi) — Sy(C) many votes of the above type by the votes as 
follows. 

oi xi 02 a^-i 

Let us call the new set of votes by Vi. We claim that, sv(C\ 2 :i, x) = svi (C\xi, x) 
for every candidate x G C \ {xi}. Fix any arbitrary candidate x G C \ {xi}. The 
votes in Vi that are same as in V contributes same quantity to both side of the 
equality. Let v he a vote that has been changed as described above. If x = oi 
then, the vote v contributes one to both sides of the equality. If x 7 ^ oi, then 
the vote contributes zero to both sides of the equality. Hence, we have the claim. 
We repeat this process for (m — I) times. Let Vi be the set of votes after the 
candidate xt gets eliminated. Now, in the above argument, by replacing V by 
Vi_i, Vi by Vi, the candidate set C by C \ {xi,..., Xi_i}, and the candidate xi 
by the candidate Xi, we have the following. 

Sy,_i(C_i,x) = Sy,(C_i,x)Vx GC \ {xi, . . . , Xi} 

Hence, we have the following. 

sv{C-i,x) = sVi{C-i,x)'ix gC \ {xi,... ,Xi} 


In the above process, the total number of votes that are changed is 

(C-i,Xi+i) - Sy (C_i)). □ 


Theorem 12. There is a {e,d)-winner determination algorithm for the STV 
voting rule with sample complexity 0( ™ 
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Proof. We sample I votes uniformly at random from the set of votes with re¬ 
placement and output the STV winner of those I votes say w' as the winner of 
the election. Let, w be the winner of the election. We will show that there exist 
I = 0( ™ 7^ ) for which w = w' with probability at least 1 — <5. Let A be 

an arbitrary subset of candidates and x be any candidate in A. Let us define a 
random variables W, 1 < * < ^ as follows. 


X, = 



if X is at top sample when restricted to A 
else 


Define another random variable sv[A^x) := Then we have, 

E[sv{A,x)\ = sv(^,a;). Now, using Chernoff bound, we have the following, 

STl 2 

Pr[|sv(A,a;) - sv(^,a;)| > —] < - , , 2 , , 

Let E be the event that 3A C C and 3x £ A, |sv(^, a;) — sv(al, a:)| > ^. By 
union bound, we have. 


Pr[L;] > 1 


m2™+i 


The rest of the proof follows by an argument analogous to the proof of Theorem 4 
using Lemma 9. □ 


5 Conclusion 

In this work, we introduced the (e, (5)-winner determination problem and showed 
(often tight) bounds for the sample complexity for many common voting rules. 
Besides closing the remaining gaps in the bounds, here are a few open directions 
to pursue in the future: 

- Is there an axiomatic characterization of the voting rules for which the sam¬ 
ple complexity is independent of m and n? We note that a similar problem 
in graph property testing was the subject of intense study [Alon et ah, 2006, 
Borgs et ah, 2006]. 

~ Specifically for scoring rules, is the sample complexity determined by some 
natural property of the score vector, such as its sparsity? 

- Is it worthwhile for the algorithm to elicit only part of the vote from each 
sampled voter instead of the full vote? As mentioned in the Introduction, 
vote elicitation is a well-trodden area, but as far as we know, it has not been 
studied how assuming a margin of victory can change the number of queries. 

- How can knowledge of a social network on the voters be used to minimize 
the number of samples made? Some initial progress in this direction has 
been made by Dhamal and Narahari [Dhamal and Narahari, 2013] and by 
Agrawal and Devanur (private communication). 
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